18.05 Lecture 36 
May 11, 2005 



Review of Test 2 (see solutions for more details) 
Problem 1: 

¥{X = 2c) = i,P(X = ic) = i ^ EX = 2c(i) + ic(i) = |c 
EX„ = (|)"c 

Problem 2: 

^1) •••) Xn 

n = 1000 

P(Xi = l) = i,P(X, = 0) = i 

M = EX = i,Var(Xi)=p(l-p) = i 

= Xi + . . . + Xn 

P(440 < S'h < A;) = 0.5 



S'^ - nEXi Sn - 1000(1/2) _ Sn - 500 
v'nVar(Xi) ^ ^1000(1/4) ~ ^250 



^,440- 500 A: - 500, 
P( <z< ) = 0.5 



^250 



/250 



by the Central Limit Theorem: 



^,fc- 500, ^,440- 500, ^,A:- 500, „ ^,A:- 500, 
^( ^ ) - $( ^ ) = ^( ^ ) - $(-3.75) = $( ^ ) 



/250 



^250 



/250 



^250 



Therefore: 



Problem 3: 



$(^) = 0.5001 ^^ = 0,^ = 500 



/250 



0e« 



'250 



Easier to maximize the log-likelihood: 



e+1 



max 



log tj){e)=n \og{e) + nO - {6 + l)\ogY\_Xi 



e 



+ n - log = 



logH^j - n 



Problem 5: 

Confidence Intervals, keep in mind the formulas! 



X — c 



(x^ — x'^) < fi < X + c 



n — 1 



^ {x^ - x^) 



n — 1 



Find c from the T distribution with n - 1 degrees of freedom. 



Ill 





ci = 3,325 C2 =16,92 



Set up such that the area between ci and C2 is equal to 1 — a 
In this example, ci = 3.325, C2 = 16.92 

Problem 4: 

Prior Distribution: 



m 



/3° 



r(a) 



/(a;i,...,a;„|6i) 



Posterior Distribution: 



(n^.) 

Posterior = r{a + n, P — n + log fl Xi) 
Bayes Estimator: 



_ Qa+n-l^-p0+ne^-0losUxi _ Q{a+n)-l ^-{l3-n+logUxi)0 



a + n 



13 -n + logllxi 
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Final Exam Format 

Cumulative, emphasis on after Test 2. 
9-10 questions. 

Practice Test posted Friday afternoon. 

Review Session on Tuesday Night - 5pm, Bring Questions! 

Optional PSet: 

pg. 548, Problem 3: 

Gene has 3 alleles, so there are 6 possible combinations. 

Pi = elp2 = elp3 = {i-e^-e2f 

P4 = 29102, P5 = 20i(l -9i- 02), P6 = 2^2(1 -Oi- 62) 

Number of categories ^ r = 6, s = 2. 
2 Free Parameters. 

^{0^,02) = 02^i02W^(l -01- ^a)'"^^ (201^2)^^2^1(1 -Oi- 02))^'{292{l - 9i - 02))^« 

_ 2JV4 + JV5 + JV6^2iVi+iV4+iV5^2JV2+iV4 + JV6^-|^ — 01 — + + 

Maximize the log likelihood over the parameters. 

log V = const. + {2Ni +N4 + N5) log ^i + (2Ar2 + Ni + Ne) log 62 + {2N3 + N5 + Nq) log(l - ^i - ^2) 
Max over ^1,^2 
log tp = a 
logOi + b 

log92 + c 
log{l -9i- 92) 

da c d b c 

Wi~ fi' l-0i-02~ 'd9^~9^~l-9i-92~ 



Solve for 611,612 



a — aOi — a^2 — c6i = 0, a — aOi — bO\ — c^i = 

n 



a + b + c' a + b + c 
Write in terms of the givens: 



ei 



2Ni + N2 + N5 1 ^ 2iV2 + Ni + Ne 1 



2n 5' ^ 2n 2 



where n = '^Ni 
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Decision Rule: 

S = {Hi:T>c,H2:T<c} 

Find c values from chi-square dist. with r - s - 1 d.o.f. 
Area above c = a — > c = 7.815 



Problem 5: 

There are 4 blood types (O, A, B, 
There are 2 Rhesus factors (+, -) 
Test for independence: 



AB) 








A 


B 


AB 




+ 


82 


89 


54 


19 


244 




13 


27 


7 


9 


56 




95 


116 


61 


28 


300 



(82- 



244(95) - 
300 ' 



244(95) 
300 



Find the T statistic for all 8 cells. 

^ X(a-i)(6-i) = xi' ^^'^ the test is same as before. 

** End of Lecture 36 



+ ... 



114 



